A Novel Data Partitioning Approach for Association Rule Mining on Grids
نویسندگان
چکیده
Mining association rules refers to extracting useful knowledge from large databases. Algorithms of this technique are both data and computation-intensive, which make grid platforms very attractive for them. However, to exploit these platforms, new data partitioning features are required where the specificities of both association rule mining technique and grids must be taken into consideration. In this paper, we propose a novel data partitioning approach for distributed association rule mining algorithms in the context of a grid computing environment. We conduct experiments using the French research grid ”Grid’5000”. Experimental results confirm that our data partitioning approach is very sufficient for balancing the load when homogeneous clusters are used. For heterogeneous clusters, the proposed data partitioning approach constitute the preprocessing phase of the process of dynamic load balancing of distributed association rule mining.
منابع مشابه
A Novel Method for Selecting the Supplier Based on Association Rule Mining
One of important problems in supply chains management is supplier selection. In a company, there are massive data from various departments so that extracting knowledge from the company’s data is too complicated. Many researchers have solved this problem by some methods like fuzzy set theory, goal programming, multi objective programming, the liner programming, mixed integer programming, analyti...
متن کاملA new approach based on data envelopment analysis with double frontiers for ranking the discovered rules from data mining
Data envelopment analysis (DEA) is a relatively new data oriented approach to evaluate performance of a set of peer entities called decision-making units (DMUs) that convert multiple inputs into multiple outputs. Within a relative limited period, DEA has been converted into a strong quantitative and analytical tool to measure and evaluate performance. In an article written by Toloo et al. (2009...
متن کاملOptimizing Membership Functions using Learning Automata for Fuzzy Association Rule Mining
The Transactions in web data often consist of quantitative data, suggesting that fuzzy set theory can be used to represent such data. The time spent by users on each web page is one type of web data, was regarded as a trapezoidal membership function (TMF) and can be used to evaluate user browsing behavior. The quality of mining fuzzy association rules depends on membership functions and since t...
متن کاملHorizontal vs. Vertical Partitioning in Association Rule Mining: A Comparison
Association rules identify associations among data items and were introduced in [1,2,3]. There are useful rule mining algorithms [4] based on the horizontal partitioning approach. These algorithms partition the database and find frequent itemsets in each partition, and combine the itemsets in each partition to get the global candidate itemsets as well as the global support for the items. In thi...
متن کاملFUZZY GRAVITATIONAL SEARCH ALGORITHM AN APPROACH FOR DATA MINING
The concept of intelligently controlling the search process of gravitational search algorithm (GSA) is introduced to develop a novel data mining technique. The proposed method is called fuzzy GSA miner (FGSA-miner). At first a fuzzy controller is designed for adaptively controlling the gravitational coefficient and the number of effective objects, as two important parameters which play major ro...
متن کامل